Learning to Detect and Track People in RGBD Data
نویسندگان
چکیده
Introduction People detection and tracking is an important and fundamental component for many robots, interactive systems and intelligent vehicles. Previous works have used cameras and 2D and 3D range finders for this task. In this paper, we present a 3D people detection and tracking approach using RGB-D data. Given the richness of the data, we learn target appearance models for the purpose of improved detection and data association. This is a new aspect for range-based target tracking which usually deals with objects of identical appearance. To this end, we take an online boosting approach similar to [2] and learn a strong target classifier based on three types of RGB-D features. This results in an on-line people detector which is combined with a novel people detector based on a generic learned model, both integrated into a multi-hypothesis tracking system. The new generic detector, called Combo-HOD, fuses the image and depth information in the data. It is composed of the HOG detector (Histogram of Oriented Gradients) [1] applied on the RGB image and a newly introduced HOD approach (Histogram of Oriented Depths), inspired from the former, applied on the depth image. HOD locally encodes the direction of depth changes and relies on an depth-informed scale-space search that leads to a 3-fold acceleration of the detection process. Combo-HOD is general in that it neither relies on background learning nor on a ground plane assumption. For the evaluation we collect RGB-D data in a populated indoor environment with a setup of three Microsoft Kinect sensors with a joint field of view. The experiments demonstrate reliable 3D detection and tracking of people in RGB-D data up to 8m from the sensor and further show how the on-line detector improves the overall tracking performance. This paper advances the state-of-the-art in the following aspects. First, we address the novel problem of detecting people in RGB-D data in distances far beyond the recommended sensor operating range (called adequate play space, see Fig. 2), second, we perform tracking of people in 3D data with a multi-hypothesis tracker (MHT), and third, we propose an online-learning method of target appearances along with its integration into the MHT.
منابع مشابه
A Software Architecture for RGB-D People Tracking Based on ROS Framework for a Mobile Robot
This paper describes the software architecture of a distributed multi-people tracking algorithm for mobile platforms equipped with a RGBD sensor. Our approach features an efficient point cloud depth-based clustering, an HOG-like classification to robustly initialize a person tracking and a person classifier with online learning to drive data association. We explain in details how ROS functional...
متن کاملNavigation of a Mobile Robot Using Virtual Potential Field and Artificial Neural Network
Mobile robot navigation is one of the basic problems in robotics. In this paper, a new approach is proposed for autonomous mobile robot navigation in an unknown environment. The proposed approach is based on learning virtual parallel paths that propel the mobile robot toward the track using a multi-layer, feed-forward neural network. For training, a human operator navigates the mobile robot in ...
متن کاملA Human Activity Recognition System Using Skeleton Data from RGBD Sensors
The aim of Active and Assisted Living is to develop tools to promote the ageing in place of elderly people, and human activity recognition algorithms can help to monitor aged people in home environments. Different types of sensors can be used to address this task and the RGBD sensors, especially the ones used for gaming, are cost-effective and provide much information about the environment. Thi...
متن کاملHuman Activity Detection from RGBD Images
Being able to detect and recognize human activities is important for making personal assistant robots useful in performing assistive tasks. The challenge is to develop a system that is low-cost, reliable in unstructured home settings, and also straightforward to use. In this paper, we use a RGBD sensor (Microsoft Kinect) as the input sensor, and present learning algorithms to infer the activiti...
متن کاملHuman Activity Detection from RGBD Image
Being able to detect and recognize human activities is important for making personal assistant robots useful in performing assistive tasks. The challenge is to develop a system that is low-cost, reliable in unstructured home settings, and also straightforward to use. In this paper, we use a RGBD sensor (Microsoft Kinect) as the input sensor, and present learning algorithms to infer the activiti...
متن کامل